CLANS: a Java application for visualizing protein families based on pairwise similarity
نویسندگان
چکیده
SUMMARY The main source of hypotheses on the structure and function of new proteins is their homology to proteins with known properties. Homologous relationships are typically established through sequence similarity searches, multiple alignments and phylogenetic reconstruction. In cases where the number of potential relationships is large, for example in P-loop NTPases with many thousands of members, alignments and phylogenies become computationally demanding, accumulate errors and lose resolution. In search of a better way to analyze relationships in large sequence datasets we have developed a Java application, CLANS (CLuster ANalysis of Sequences), which uses a version of the Fruchterman-Reingold graph layout algorithm to visualize pairwise sequence similarities in either two-dimensional or three-dimensional space. AVAILABILITY CLANS can be downloaded at http://protevo.eb.tuebingen.mpg.de/download.
منابع مشابه
Determination of genetic uniformity in transgenic cotton plants using DNA markers (RAPD and ISSR) and SDS-PAGE
One concern about using transgenic plants is the genetic variation that occurred from theirs tissue culture and regeneration. Molecular markers are an important element for efficient and effective determination of genetic variation. The present work was carried out to assess the genetic uniformity of transgenic cottons (Bt and chitinase lines), using RAPD, ISSR molecular markers and SDS-PAGE an...
متن کاملA history of Floral diversity (pollen, spores and algal) during the latest Holocene in the Bandung basin based on palynological analysis in Cihideung, West Java, Indonesia
Floral diversity is a measure of number of type flora in an area, and reflects how vegetation develops in response to the environmental condition during a certain time interval. The present study aims to examine changes in the diversity of vegetation (pollen, spores and algae), evenness, and similarity in the Bandung Basin through a core of 240 cm depth using a ground drill, as well as the ...
متن کاملAnalyzing microarray data using CLANS
UNLABELLED Analysis of microarray experiments is complicated by the huge amount of data involved. Searching for groups of co-expressed genes is akin to searching for protein families in a database as, in both cases, small subsets of genes with similar features are to be found within vast quantities of data. CLANS was originally developed to find protein families in large sets of amino acid sequ...
متن کاملProteins comparison through probabilistic optimal structure local alignment
Multiple local structure comparison helps to identify common structural motifs or conserved binding sites in 3D structures in distantly related proteins. Since there is no best way to compare structures and evaluate the alignment, a wide variety of techniques and different similarity scoring schemes have been proposed. Existing algorithms usually compute the best superposition of two structures...
متن کاملClustering Rfam 10.1: Clans, Families, and Classes
The Rfam database contains information about non-coding RNAs emphasizing their secondary structures and organizing them into families of homologous RNA genes or functional RNA elements. Recently, a higher order organization of Rfam in terms of the so-called clans was proposed along with its "decimal release". In this proposition, some of the families have been assigned to clans based on experim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 20 18 شماره
صفحات -
تاریخ انتشار 2004